AMENDMENTS TO THE CLAIMS 

1 . (Currently Amended) A method comprising: 

interpreting user input received at a client mobile device from a user, the 

interpreting including identifying a selection of at least one of a plurality 
of web interaction modes, each of the plurality of web interaction modes 
to perform interpretation of content on a server computer system and a 
client mobile device coupled with the server computer system, the 
plurality of web interaction modes including a focus mechanism; 

identifying, via the focus mechanism, an active display element associated with 
and applying the user input to the active display element , and focusing the 
client mobile device on the active display element; 

transmitting the active display element to the server computer system such that 

real-time speech recognition is performed based on synchronization of the 
active display element with one or more speech elements of speech, the 
speech recognition to reduce speech computing load and eliminate speech 
dictation; and 

dynamically correcting grammar using the real-time speech recognition based on 
the synchronization of the active display element and the one or more 
speech elements. 



computer syt 



jpeech identifier of the identified 



focused display element; and 



recognizing speech associated with the 



input based on evaluating 



corresponding relationship between the display element and a speech 



element of the speech. 
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Claims 2-3 (Cancelled) 

4. (Previously Presented) The method as claimed in Claim 1 wherein the focused 
display element comprises a hyperlink or a field in a form. 

5. (Cancelled) 

6. (Previously Presented) The method as claimed in Claim 1 further including: 
extracting speech features from the user input, and generating a client mobile 

device request based in part on the extracted speech features. 

7. (Cancelled) 

8. (Previously Presented) The method as claimed in Claim 1 further including: 
receiving a session message at the server computer system to initialize a 
connection between the server computer system and the client mobile device, 
wherein the session message includes an internet protocol (IP) address of the 
client mobile device, a device type of the client mobile device, a voice character 
of a user responsible for the user input, a language of the user input, and a default 
recognition accuracy requested by the client mobile device. 

9. (Cancelled) 

10. (Previously Presented) The method as claimed in Claim 1 further including: 
receiving a transmission message at the server to exchange transmission 

parameters between the server computer system and the client mobile 
device. 
Claims 11-13 (Cancelled) 

14. (Previously Presented) The method as claimed in Claim 1 further including: 

receiving an exit message at the server computer system to terminate a user 

session with the server computer system and the client mobile device. 
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Claims 15-34 (Cancelled) 

35. (Currently Amended) A machine-readable medium having instructions which 
when executed cause a machine to: 

interpret user input received at a client mobile device from a user, the interpreting 
including identifying a selection of at least one of a plurality of web 
interaction modes, each of the plurality of web interaction modes to 
perform interpretation of content on a server computer system and a client 
mobile device coupled with the server computer system, the plurality of 
web interaction modes including a focus mechanism; 

identify, via the focus mechanism, an active display element associated with an 
applying the user input to the active display element , and focusing the 
client mobile device on the active display element the user input; 

transmit the active display element to the server computer system such that real- 
time speech recognition is performed based on synchronization of the 
active display element with one or more speech elements of speech, the 
speech recognition to reduce speech computing load and eliminate speech 
dictation; and 

dynamically correct grammar using the real-time speech recognition based on the 
synchronization of the active display element and the one or more speech 
elements. 

receive, at the server computer system, a speech identifier of the identified 
focused display element; and 

Attorney Docket No. 42P14283 4 
Application No. 10/534,661 



recognize speech associated with the user input based on evaluating a 

corresponding relationship between the display element and a speech 
element of the speech. 

36. (Cancelled) 

37. (Cancelled) 

38. (Previously Presented) The machine-readable medium as claimed in Claim 35 
wherein the focused display element is a hyperlink or a field in a form. 

Claims 39-44 (Cancelled) 

45. (Currently Amended) A system comprising: 

a server computer system coupled with a client mobile device, the server 
computer system to 

interpret user input received at a client mobile device from a user, the interpreting 
including identifying a selection of at least one of a plurality of web 
interaction modes, each of the plurality of web interaction modes to 
perform interpretation of content on a server computer system and a client 
mobile device coupled with the server computer system, the plurality of 
web interaction modes including a focus mechanism; 

identify, via the focus mechanism, an active display element associated with and 
applying the user input to the active display element , and focusing the 
client mobile device on the active display element; 

transmit the active display element to the server computer system such that real- 
time speech recognition is performed based on synchronization of the 
active display element with one or more speech elements of speech, the 
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speech recognition to reduce speech computing load and eliminate speech 
dictation; and 

dynamically correct grammar using the real-time speech recognition based on the 
synchronization of the active display element and the one or more speech 
elements. 

receive, at the server computer system, a speech identifier of the identified 

focused display element; and 
recognize speech associated with the user input based on evaluating a 

corresponding relationship between the display element and a speech 

element of the speech. 

46. (Previously Presented)The system as claimed in Claim 45 wherein the server 
computer system is further to: 

extract speech features from the user input, and generate a client mobile device 
request based in part on the extracted speech features. 

47. (Previously Presented) The system as claimed in Claim 45 wherein the server 
computer system is further to: 

receive a session message at the server computer system to initialize a connection 
between the server computer system and the client mobile device, wherein the 
session message includes an internet protocol (IP) address of the client mobile 
device, a device type of the client mobile device, a voice character of a user 
responsible for the user input, a language of the user input, and a default 
recognition accuracy requested by the client mobile device. 

48. (Previously Presented) The system as claimed in Claim 45 wherein the server 

computer system is further to: 
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receive a transmission message at the server to exchange transmission parameters 
between the server computer system and the client mobile device. 
49. (Previously Presented) The method as claimed in Claim 45 wherein the server 
computer system is further to: 

receive an exit message at the server computer system to terminate a user session 
with the server computer system and the client mobile device. 
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